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Abstract. We study logics defined in terms of second-order monadic monoidal and 
groupoidal quantifiers. These are generalized quantifiers defined by monoid and groupoid 
word-problems, equivalently, by regular and context-free languages. We give a computa- 
tional classification of the expressive power of these logics over strings with varying built-in 
predicates. In particular, we show that ATIME(n) can be logically characterized in terms 
of second-order monadic monoidal quantifiers. 



We study logics defined in terms of so-called second-order monadic monoidal and groupoidal 
quantifiers. These are generalized quantifiers defined by monoid and groupoid word-problems, 
equivalently, by regular and context-free languages. A groupoid is a finite multiplication 
table with an identity element. For a fixed groupoid G, each S C G defines a G-word- 
problem, i.e., a language W{S,G) composed of all words w, over the alphabet G, that can 
be bracketed in such a way that w multiplies out to an element of S. The word-problem 
of a monoid, i.e., an associative groupoid, is defined analogously. Groupoid word-problems 
relate to context-free languages in the same way as monoid word-problems relate to regular 
languages: every such word-problem is context-free, and every context-free language is a 
homomorphic pre-image of a groupoid word-problem (this result is credited to Valiant in 
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In descriptive complexity, (first-order) monoidal quantifiers have been studied exten- 
sively in connection to the complexity class NC^ and its sub-classes (see [H [3l [231 l25]). 
However, in order to define non-regular languages in terms of monoidal quantifiers, some 
built-in relations, in addition to <, need to be assumed. It was shown already in that 
first-order logic with unnested unary monoidal quantifiers characterizes the class of regular 
languages, REG, over strings without auxiliary built-in relations. This characterization of 
REG was generalized in [16j to allow also non-unary monoidal quantifiers, even with ar- 
bitrary nestings. In [11], the same was shown to hold for second-order monadic monoidal 
quantifiers: 

mon-Qj^onFO = SOM(mon-Qj^„„) = REG = 3S0M. (1.1) 
In (II. ip . 3S0M denotes existential second-order monadic logic and the logic mon-Q^^o^FO 
consists of all formulas in which a monadic second-order monoidal quantifier Qj^ is applied 
to an appropriate tuple of FO-formulas without further occurrences of second-order quanti- 
fiers. On the other hand, in SOM(mon-Qj^^jj) arbitrary nestings of monoidal quantifiers are 
allowed. Here a crucial assumption is that there are no auxiliary built-in relations, besides 
the order, since already SOM(-|-), i.e., second-order monadic logic with built-in addition, 
defines exactly the languages in the linear fragment of the polynomial hierarchy |20j . 

We see that with monoidal quantifiers the situation is clear-cut, i.e., formulas with 
monadic second-order monoidal quantifiers cannot define non-regular languages. On the 
other hand, over strings with built-in arithmetic (i.e., built-in -|- and x) the classes in (II. ip 
are presumably not equal, e.g., 3S0M C NP and already in m.on-Qlj^^^FO{+, x) PSPACE- 
complete languages can be defined as we show below in Corollary 15.21 

In [5], the elaborate theory connecting monoids to the fine structure of NC^ was gen- 
eralized to groupoids and LOGCFL. It was shown in [5] that there exists a single groupoid 
whose word-problem is complete for LOGCFL under DLOGTIME-reductions, implying 
also a logical characterization for LOGCFL in terms of first-order groupoidal quantifiers. 
Building on this result, a systematic investigation of first-order groupoidal quantifiers was 
initiated in [16] . 

In [TT] it was asked what is the relationship of the corresponding (second-order) logics 
if monoidal quantifiers are replaced by groupoidal quantifiers in (jl.ip . Here we address this 
question and show the following (see Corollary 13. 4p : 

mon-Q^,pFO(+, x) = SOM(mon-Q^,p). (1.2) 

It is interesting to note that for groupoidal quantifiers we have a similar collapse result as 
for monoidal quantifiers, but this time assuming built-in arithmetic on the left. Note that, 
over ordered structures, the relations -|- and x are definable in the logic SOM(mon-(5Qfp) 
(see [1] and |16j). It is an open question whether the built-in relations -|- and x are really 
needed for the equivalence in (II. 2p to hold. 

In the literature, second-order monadic quantifiers have been studied under two slightly 
different semantics (for each L, quantifiers and Q^)- We will show that the analogue of 
()1.2p for the alternative semantics Q'^ remains valid even if we drop the built-in predicates 
-|- and X from mon-(5Qj.pFO(-|-, x), i.e., 

mon-Q^rpFO = SOM(mon-Q^,p). (1.3) 

Since the logics in (II. 2p and ()1.3p are all equivalent (see Corollary 13. 4p , it follows that the 
only remaining open question regarding the equivalences between logics with groupoidal 
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quantifiers is whether 

mon-Q^,pFO = mon-Q^,pFO? 
This question is directly concerned with the problem of pinning down the exact expressive 
power of the so-called finite leaf automata with context-free leaf languages (see Theorem 
\Tm and Corohary [2TlD . 

In this paper we aim for a concise classification of the expressive power of the logics 
with second-order monadic monoidal and groupoidal quantifiers. We first note that the 
difference between the two semantics, i.e., and Q^, disappears assuming built-in arith- 
metic. This already simplifies the picture considerably. However, especially in the monoidal 
case, the expressive power of the quantifiers without built-in arithmetic remains open. 
For groupoidal quantifiers, we find that 

mon-Q^,pFO = SOM(mon-Q^,p) = 2L0GCFL^ 

where 2^*-'^^^'^ equals the class of languages whose tally version resides in LOGCFL. For 
monoidal quantifiers, we show that 

SOM(mon-Q^„n, +, x) = ATIME(n). 

Table [T] below contains a summary of our complexity results. 



2. Preliminaries 

We follow standard notation for second-order monadic logic with linear order, see, e.g., |24j . 
We mainly restrict our attention to string structures, i.e., structures of string signatures 
T = (Pqi , ■ ■ ■ , Pas ) 1 where all the predicates Pa^ are unary. We assume that the universe 
dom(yi) of each structure A is of the form {0, . . . ,n — 1} and that the logic's linear order 
symbol refers to the numerical order on {0, . . . , n — 1}. We restrict attention to structures A 
in which the interpretations P^. of the predicates Pq- satisfy the following: -P^nP^ = 0, for 
i 7^ j, and Ui<j<si-'^ = dom(yi). Such r-structures correspond to strings over the alphabet 
{ai, . . . , as} in the usual way. 

An alphabet S is a finite set of symbols. For technical reasons to be motivated shortly, 
we assume that every alphabet has a built-in linear order, and, to indicate that order, we 
write alphabets as sequences of symbols, e.g., in the above case we write (ai, . . . , a^). The 
set of all finite S-strings is denoted by S* and S"*" = S* \ {e}, where e is the empty string. 
For a string w, \w\ denotes the length of w and \w\a the number of occurrences of the letter 
avaw. The concatenation of the strings w and w' is denoted by w'^w' , and denotes the 
string bi- ■ - bk, where bi = a for 1 < i < k. For L C S* and e € E, the letter e is a neutral 
letter of L if for all u,v € S*, we have uv (z L -4=^ uev G L. The class of languages that 
have a neutral letter is denoted by !N. 

For a signature r = {Pai, ■ ■ ■ , Pas)i the first-order r- formulas, FO[r], are built from first- 
order variables in the usual way, using the Boolean connectives {A, V,-!}, the predicates 
Pai together with {=, <}, the constants min and max, the first-order quantifiers {3, V}, and 
parentheses. SOM[r] extends FO[t] in terms of unary second-order variables and second- 
order quantifiers {3,V}. (The letters SOM stand for second order monadic logic; in the 
literature, this logic is sometimes denoted by MSG.) 

For a complexity class C and logics L and we write £ < £' if for every string 
signature r (unless otherwise specified), and every sentence ip £ L[t] there is an equivalent 
sentence tp E -^'[t]. Analogously, we write £ < C if the class of languages, over any 
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alphabet, which can be defined in L is contained in C. We write £ = = C) if £ < 

and < C and C < £). It is known [19j that FO is equal to the class of star-free 

regular languages and that SOM = REG, where REG is the class of regular languages (see 
[HI El [26]). 

Sometimes we assume that our structures (and logics) are equipped with auxiliary built- 
in predicates in addition to <, e.g., the ternary predicates + and x. The predicates + and 
X are defined as 

+ {i,j,k) ^ i + j = k, 

x{ij,k) ^ ixj = k. 

The predicate BIT is a further important predicate which is defined by: BIT(a,j) holds 
iff the bit with weight 2^ is 1 in the binary representation of a. The presence of built-in 
predicates is signalled, e.g., by the notation F0(+, x) and FO{Ql,+, x). It is well known 
that F0(+, x) = FO(BIT) (see fW^). In fact, it was shown in [TOl that BIT alone can 
define the corresponding canonical ordering, i.e., the symbol < can dropped from FO(BIT) 
without a loss in expressive power. 

2.1. Generalized quantifiers. Next, we extend logics in terms of generalized quantifiers. 
The Lindstrom quantifiers of Definition 12.11 are precisely what has been referred to as 
"Lindstrom quantifiers on strings" [9|. The original more general definition |17j uses trans- 
formations to arbitrary structures, not necessarily of string signature. 

Definition 2.1. Consider a language L over an alphabet S = (oi, 02, . . . , a^). Such a 
language gives rise to a Lindstrom quantifier Ql, that may be applied to any sequence of 
s — 1 formulas as follows: 

Let X be a /c-tuple of pairwise distinct variables. Let A he a structure and dom(yi) = 
{0, 1, . . . , n — 1}. We assume the lexicographic ordering on {0, 1, . . . , n — 1}^, and we write 
^(0) < ^(1) < . . . < for the sequence of potential values taken on by x. The /c-ary 

Lindstrom quantifier Ql binding x takes a meaning if s — 1 formulas, each having as free 
variables the variables in x (and possibly others), are available. Let (pi{x), if2{x), 
ips -lix) be these s — 1 formulas. Then 

A 1= QLx[ipi{x),ip2{x), . . .,tps^i{x)] 

iff the word of length n'^ whose ith. letter, < i < n'^' — 1, is 

r ai ifyihv'i(x»), 

I 02 ifyih-¥'i(a;(*^)Av52(^«), 

[as if yi h -V^i (x(*) ) A -(^2 (x(*) ) A • • • A ^^s-iix'-'^ ) , 

belongs to L. 

As an example, take s = 2 and consider L3 := 0*1(0 + 1)*; then Ql^ is the usual 
first-order existential quantifier. Similarly, the universal quantifier can be expressed using 
the language Ly := !*• Finally, for p > 1 and Lmodp = {w & {0, 1}* | \w\i = (mod p)}, 
the quantifiers Ql^^^p are known as modular counting quantifiers |24j . 

Definition 2.2. Let r be a signature, L a language over an alphabet S = (01,02, . . . ,as), 
and C a class of languages. 
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• The set of r-formulas, Q_lFO[t], of the logic QlFO consists of all formulas of the form 

QlX [fl {x) ,Lp2{x),. . . ,Lps_i{x)\, 

where, for some k^xisa, /c-tuple of pairwise distinct variables, and ^^^(x) is a FO[T]-formula 
for 1 < i < s — 1. 

• The set of r-formulas, ¥0{Qi)[t\, of the logic F0((5l) is defined by extending the formula 
formation rules of FO by the following clause: if, for some /c, x is a /c-tuple of pairwise 
distinct variables, and ifiijc) is a formula for 1 < « < s — 1, then 

is a formula, too. 

• Define the sets of r-formulas of the logics QqFO and FO(Qe) by 

geFO[r] := IJQlFOM, 

Lee 

FO(Qe)[T] := [JfO{Ql)[t]. 

Lee 

In this article we are especially interested in quantifiers defined by monoid and groupoid 
word-problems. 

Definition 2.3. A groupoidal quantifier (a monoidal quantifier) is a Lindstrom quantifier 
Ql where L is a word-problem of some finite groupoid (monoid). The usage of groupoidal 
quantifiers and monoidal quantifiers in our logical language is signalled by the subscripts 
Grp and Mon, respectively. We define 

QcrpFO := QeFO FO(QGrp) := FO(Qe) 
QMonFO := Qe'FO FO(gMon) := FO(Qe'), 
where C (C) is the class of all word-problems of finite groupoids (monoids). 

Second-order Lindstrom quantifiers on strings were introduced in [9]. Here, we are 
mainly interested in those binding only set variables (i.e., unary relations), so-called monadic 
quantifiers. For each language L, we define two monadic quantifiers Q\ and Q\ with slightly 
different interpretations. It turns out that the interpretation Q\, which was used in is 
natural in the context of finite leaf automata. On the other hand, the quantifier Q*^ is the 
exact second-order analogue of the corresponding first-order quantifier Qi. 

Definition 2.4. Consider a language L over an alphabet S = (ai, 02, . . . , a^). Let X = 
(Xi, . . . ,Xfc) be a fc-tuple of pairwise distinct unary second-order variables and let >l be a 
structure with dom(yi) = {0, 1, . . . , n — 1}. There are 2"'^ different instances (assignments) 
of X over A. We assume the following ordering on those instances: Let each instance of a 
single Xi be encoded by the bit string Sg • • • s\^_i with the meaning s*- = 1 j ^ X-i. 

Then 

(1) we encode an instance of X by the bit string 

■^O'So ■ ■ ■ ■^O'^i'^i ■ ■ ■ s\ - ■ ■ s\_is1^_i ■ ■ ■ s^_i 

and order the instances lexicographically by their codes. 

(2) we encode an instance of X by the bit string 

'^O'^l ■ ■ ■ Sn-l'^O'^l ■ ■ ■ s'ii-l ■ ■ ■ Sq'Si • • • s^-i 

and order the instances lexicographically by their codes. 
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The monadic second-order Lindstrom quantifier Q\ (respectively Ql) binding X takes a 
meaning if s — 1 formulas, each having free variables X, are available. Let ipi{X), ip2{X), 
. . . , ips-i{X) be these s — 1 formulas. Then 

A [= QiX[^i(X), ^2(X), . . . , ^.-i(X)] 

iff the word of length 2^^ whose ith letter, < i < 2^^ — 1, is 

' ai -liA^ifiiX^^), 
as i^A^^^l{X^'^)^^2{X^'^), 

^ as if N -9^1 (^^'^) A -<^2(X^'^) A • • • A -<^,_i(X^'^), 

(0) (2) (2"'' — 1) 

belongs to L. Above, X < X < ■ ■ ■ < X denotes the sequence of all instances 

ordered as in ([1]). The notation Q*^ is used when the instances are ordered according to ([2]). 

Again, taking as examples the languages L3 and Ly, we obtain the usual second-order 
monadic existential and universal quantifiers. Note that for L G {L^,L\/} the quantifiers 
Q\ and Q\ are "equivalent". This is due to the fact that, for the membership in L, the 
order of the letters in a word does not matter. 

Definition 2.5. Let r be a signature, L a language over an alphabet S = (oi, 02, . . . , 
and C a class of languages. 

• The set of r-formulas, mon-Q/^FO[T], of the logic mon-Q^FO consists of all formulas of 
the form 

Q\X[pi{X), v^2(X), . . . , ^s-iiX)] , (2.1) 

where, for some k, X is a fc-tuple of pairwise distinct unary second-order variables, and 
(Pi{X) is a FO[r]-formula with variables X , for 1 < i < s — 1. 

• The set of r-formulas, SOM(mon-(5^)[T], of the logic SOM(mon-(5]^) is defined by ex- 
tending the formula formation rules of SOM[r] by the following clause: if, for some k, X 
is a fc-tuple of pairwise distinct unary second-order variables, and (pi{X) is a formula for 
1 < i < s — 1, then 

QiX[v9i(X),(^2(X),...,<^,_i(X)] 

is a formula, too. 

• Define the sets of r-formulas of the logics mon-QgFO and SOM(mon-(5e)[T] by 

mon-Q^FO[r] := |J mon-QiFO[r], 
Lee 

SOM(mon-Q^)[r] := |J SOM(mon-Qi)[r]. 

Lee 

• The logics mon-Q|^FO, SOM(mon-(5|^), mon-QgFO, and SOM(mon-Qg) are defined anal- 
ogously by replacing Q\ everywhere with Q^. 

Analogously to the first-order case (see Definition 12. 3p . we use the subscripts Grp and 
Mon to indicate that all groupoidal quantifiers or monoidal quantifiers are available in the 
corresponding logic, e.g., SOM(mon-(5Qj.p) := SOM(mon-(2g), where C is the class of all 
word-problems of finite groupoids. 
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The next proposition shows that the difference between the two semantics of second- 
order monadic quantifiers disappears in the presence of built-in arithmetic (or if the arith- 
metic predicates are definable). Below, we write V''^ for the relation defined by the formula 
^ in a structure A, i.e., ii ip has k free variables, then 

V'^ = {a G dom(yi)'= I A ^ il^{a)}. 

Lemma 2.6. Let Xi, . . . , X^ be unary second-order variables. There are F0(+, x)-formulas 
(f)i{x, X), . . . , cj)k{x, X) and '4)i{x.,X)., . . . ,il)k{x.,X) such that for all A andAi,...,Ak C 
dom(yi) = {0, 1, . . . ,n — 1}, (where Ai is encoded by Sq - ■ ■ as in Definition \2.4\ ) it holds 

that the encoding of {cl)^^''^\ . . . , (j)^^''^'^) as a bit string as in clausell\ of Definition \2.4\ results 
with 

^0^1 ' ' ' ^n-l^O^l ' ' ' ^n-l ■ ■ ■ ^0^1 ' ' ' ■5n-l> (2-2) 

and the encoding of . . . jV^i^'"^^) o.s a bit string as in clause\E of Definition \2.4\ results 

with 

^0^0 ' ' ' ^O^l^l • • • Sl ■ • • Sn-l'^n-l ' • • Sji-l- (2-3) 

Proof. Let us show how to construct the formulas (/>i(x, X) . . . ,(pk(x, X). The idea simply 
is that 

A^UJ,A) 

should hold if the bit in position jk + i from the left in (|2.2|) is 1 if and only jk + i = 
(c— l)n + r and r — 1 € Ac, where 0<j<n — 1, l<i</c, l<c</c, and 1 < r < n. This 
condition can be easily expressed in F0(+, x). The formulas i^iix, X) . . . , tpki^, X) can be 
constructed completely analogously. 

Note also that the formulas (/'i(x, X), . . . , (pkix, X) define a permutation of /c-tuples of 
unary relations and that il^ii^x, X), . . . , ipki^, X) define the inverse of this permutation. □ 

Proposition 2.7. For any L, mon-(5]^FO(+, x) = mon-(5^FO(+, x). 

Proof, by Lemma [2?6l Q\X \ipi{X), . . . , ips^i{X)\ can be expressed as 

Qlx[pi{Xi/ilji{X), . . . ,X,./V^fc(X)), . . . , . . . . 

Analogously, Q\X . . . , i.p s-i{X)\ can be expressed as 

QlX[ipi{Xi/ct)i{X), . . . ,Xfc/0fc(X)), . . . , ^,_i(Xi/(/>i(X), . . . ,X,,/0fc(X))] . □ 

By Proposition 12.71 the two semantics of second-order quantifiers coincide for all the logics 
(with built-in or definable arithmetic) considered in this article. 

Remark 2.8. Let L and A be as in Definition 12.41 It is worth noting that, for m > 1, the 
m-ary second-order quantifiers Q\ and Q*j^ can be defined by straightforward modifications 
to Definition 12.41 The m-ary quantifiers binds a /c-tuple X = (Xi, . . . ,Xk) (for some k) 
of m-ary second-order variables in s — 1 many formulas. Each Xi is encoded by the bit 
string So • • • s„m_x with the meaning Sj = 1 if and only if the jth tuple in the lexicographic 
ordering of {0, 1, . . . , n — l}"* is in Xi. The semantics of the m-ary quantifiers Q\ and 
can be now defined analogously to Definition 12.41 We use the notation Q^YO and Q^FO 
for the analogues of mon-Qj^FO and mon-Q^FO in which the m-ary quantifiers Q\ and Q\ 
are allowed for m > 1. 
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We end this section by showing that, in the non- monadic case, the analogue of Propo- 
sition 12.71 holds without built-in arithmetic if L has a neutral letter. 

Proposition 2.9. For any L G X, Q\¥0 = Q\F0. 

Proof. We may assume that L has an alphabet S = (ai, 02, . . . , Og), where is a neutral 
letter. 

We will first show that Q^FO < Q\¥0. The idea of the proof is to show that a formula 
ip G Q*^0 can be replaced by a formula tp' G Q^FO in which only one second-order variable 
with higher arity is quantified. Now, in ■0', the quantifier Q\ can be replaced by Q\ since 
the difference of the two semantics only appears if more than one variable is quantified. 
The converse inclusion follows directly from the fact that Q\Y0 < Leaf^(L) = Q^FO (see 
Theorem EH]). 

Let -0 G QIFO be of the 

:= QlX[<f,(X), V2(X),..., ^s~i(X)] , 

where X = {Xi, . . . , X^) is a tuple of m-ary second-order variables. It is straightforward to 
construct a formula ip' G Q^FO 

i;' := QlRi [f[{Ri), (/^'al^i), • • • , ^'s-iiRi)] , 

where the arity of Ri is m + [log(fc)J -|- 1, which is equivalent to ip over structures A with 
|dom(^)| > 2. Let A be a structure such that |dom(yi)| > 2 and Ai C dom(yi)™'. The idea 
is to encode the tuple A = (Ai, . . . , Af^) by a unique (m -|- [log(A:)J -|- l)-ary relation B-^ 

%= U {(ii'---'/Liog(fc)J+i)} 

l<i<fc 

where Jl " " " j[iog(fc)j+i length [log(A;)J -|- 1 binary representation of i. This ensures 

that the ordering of the tuples A (see Definition 12.41 and Remark 12. 8p coincides with the 
ordering of the corresponding codes B^. Therefore, it suffices to construct the formulas 
(/9-(i?i) in such a way that, for all Ai, . . . , A/. C dom(yi)'" 

A^^'iiB^) ^ A^^,{Ai,...,Ak), 

and, ii B B^ for all A, then A Y= ^[{B) implying that the formulas ip'^{Ri) output the 
neutral letter when Ri is interpreted by the relation B. 

In order to ensure that V ^-^d ip' are equivalent also over structures A for which 
|dom(yi)| = 1, we may further replace the formulas (p[{Ri) by formulas ip*{Ri,...,Rk), 
where each Ri, for 2 < i < fc, is also (m -|- [log(A;)J -|- l)-ary and ip*{Ri, . . . , Rk) has the 
following form 

i\dom{A)\ = lAXiiRi,...,Rk))yi\dom{A)\>lA /\ i?, = A V9^(i?i))), 

2<i<k 

where Xi simulates the behavior of (pi on structures with cardinality 1 (on structures with 
cardinality 1 the quantifiers Q\ and are equivalent). Note that, for A with |dom(yi)| > 2, 
the formulas (p*{Ri, . . . ,Rk) output the neutral letter if Rf ^ for some 2 < i < k. It 
follows that for all A 

A^^p^A^ Q\R[pl{R),ipl{R), ^:_,(R)] . 
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For the converse, it suffices to note tliat a polynomial-time non-deterministic Turing 
machine with the leaf language L can easily evaluate sentences of Q^YO implying that 
Q^FO < LeafP(L) (see [9j). Therefore, by Theorem (HD we get that Q\YO < QIFO. □ 

2.2. Leaf languages. In this section we give a brief introduction to the leaf languages 
approach in computational complexity. 

The leaf languages approach was introduced by Bovet, Crescenzi and Silvestri in [6j and 
independently by Vereshchagin in [28] . In this approach the acceptance of a word input to 
a nondeterministic Turing machine depends only on the values printed at the leaves of the 
computation tree. 

Let M be a nondeterministic Turing machine which halts on every computation path 
with some order on the nondeterministic choices. The order of the nondeterministic choices 
induces a left-to-right ordering of all the leaves in the computation tree of M on input x. 
Define leafstring*^(x) to be the concatenation of the symbols printed at the leaves of the 
computation tree in this order. Given now a language B, the class Leaf^(i?) contains those 
languages L for which there is a polynomial-time non-deterministic Turing machine M such 
that for all inputs x: a; € -L iff leafstring^^(a;) € B. 

Let us look at some examples. Define Maj := {w € {0, 1}^| > |w|o}. 

Example 2.10. The following leaf language classes are well known: 

• NP = LeafP(0*l(0 + l)*), 

• PP = Leaf P (Maj), 

. ModqP = LeafP(Lmodg). 

In [S] complexity classes defined by leaf languages were logically characterized in terms of 
generalized second-order quantifiers. In particular, for every language B that has a neutral 
letter the following was shown to hold. 

Theorem 2.11 ([9^). For any S € X, Lea{^{B) = Q^^FO. 

Note that, for Theorem l2.11l to hold, the quantifier must be allowed to bind relation 
variables of arbitrary arity (see Remark 12. 8p . Although the m-ary second-order quantifiers 

and differ, in Theorem 12.111 we can equivalently use the semantics instead of 
Q*^ by Proposition 12.91 

Since it is known that there are regular languages B, e.g., the word-problem for the 
group 55, for which Leaf^{B) = PSPACE [l3], we conclude that for such B, 

Q^FO = PSPACE. 



2.3. Finite leaf automata. The automata theoretic analogue of a Turing machine with a 
leaf language is the so-called finite leaf automaton ^2j . 

A finite leaf automaton is a tuple M = {Q, S, 5, s, F, /?) where Q is a finite set of states, 
S is an alphabet, the input alphabet, 6: Q x Ti ^ is the transition function, s € Q 
is the initial state, F is an alphabet, the leaf alphabet, and /3: Q ^ F is a function that 
associates a state q with its value f3{q). The sequence 6{q, a), for q G Q and a G S, contains 
all possible successor states of M when reading letter a while in state q, and the order of 
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letters in that sequence defines a total order on these successor states. This definition allows 
the same state to appear more than once as a successor in 5{q, a). 

Let M be as above. The computation tree Tm{w) of M on input w is a labeled directed 
rooted tree defined as follows: 

• The root of Tm{w) is labeled {s,w). 

• Let u be a node in Tm{w) labeled by {q,x), where x ^ e (the empty word), x = ay for 
a € S, y G S*. Let 5{q, a) = qiq2 ■ ■ ■ qk- Then v has k children in Tm{w), and these are 
labeled by {qi,y), {q2,y), ■ ■ ■ , {qk,y) in this order. 

If we look at the tree Tm{w) and attach the symbol /3(g) to a leaf in this tree with label 
{q,e), then leafstring'^(7i;) is defined to be the string of symbols attached to the leaves, read 
from left to right in the order induced by 5. 

Definition 2.12. For A C F*, the class Leaf^^(^) consists of ah languages B C E*, for 
which there is a leaf automaton M as just defined, with input alphabet S and leaf alphabet 
r such that for all tt; G S*, t/; G S iff leafstring*^(t(;) G A. If C is a class of languages then 
Leaf^^(C7) = UAGcLeaf^^(A). 

In [22] the acceptance power of leaf automata with different kinds of leaf languages was 
examined. It was shown that, with respect to resource-bounded leaf language classes, there 
is not much difference, e.g., between automata and Turing machines. On the other hand, 
if the leaf language class is a formal language class then the differences can be huge. In 
particular, it was shown that 

Leaf (REG) = REG, 

while it is known that 

LeafP(REG) = PSPACE. 
In |22] the power of Leaf^^(CFL) was left as an open question. The only upper and lower 
bounds known at that time were GEL C Leaf''^(GEL) C DSPAGE(n2) n DTIME(2'^(")) . 
Recently it was shown by Lohrey |18] that indeed Leaf (GEL) does contain a PSPAGE- 
complete language. 

In [llj the class Leaf^^(L) was logically characterized assuming that the language L 
has a neutral letter. 

Theorem 2.13 (dU). For any L G X, Leaf'''^(L) = uiou-Q\FO. 

Corollary 2.14. PSPAGE-compZeie languages can he defined in mon-QQj.pEO. 

Proof. By the result of |T8], there is a language L G GEL such that the class Leaf^^(-L) con- 
tains a PSPAGE-complete language. Since L reduces via a length-preserving homomorphism 
to some groupoid word-problem ^ [5], it follows that also the class Leaf^^(yl) = mon-Q^EO 
contains a PSPAGE-complete language. □ 

2.4. Complexity theory. We assume familiarity with the basic notions in formal lan- 
guages and complexity theory, e.g., complexity classes such as NP, PP, PH, and PSPAGE. 
REG and GEL refer to the regular and context-free languages. Also, LOGGEL denotes the 
closure of GEL under log-space reductions. 

In this article AG°, AGG°, TG°, NG\ and SAG^ refer to the classes of languages 
recognized by DLOGTIME- uniform families {Cn)nen of polynomial-size circuits with the 
following kinds of gates: 
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AC : the circuit Cn may have NOT, unbounded fan-in AND and OR gates, and constant 
depth. 

ACC°: the circuit C„ may have NOT, unbounded fan-in AND, OR and MOD^ gates, and 
constant depth. 

TC'': the circuit Cn may have NOT, unbounded fan-in AND, OR, and MAJORITY gates, 
and constant depth. 

NC^: the circuit C„ may have NOT, bounded fan-in AND and OR gates, and 0(log(n)) 
depth. 

SAC^: the circuit C„ may have input level NOT gates, bounded fan-in AND and unbounded 
fan-in OR gates, and 0(log(n)) depth. 

The requirement of DLOGTIME-uniformity means that {Cn)n&n-, as a family of directed 
acyclic graphs, can be recognized by a deterministic Turing machine, with random access 
to its input, in time 0(log(n)) (see [29] for details). Note that, e.g., the classes AC* and 
TC* are defined analogously as above but allowing 0(log*(n)) circuit-depth. 

In this article we also discuss certain complexity classes defined in terms of alternat- 
ing Turing machines. We denote by ATIME(t(n)), the class of languages which can be 
recognized in time t[n) by some alternating Turing machine. 

Let M be an alternating Turing machine accepting x and denote by T the computation 
tree produced by M with input x. An accepting computation subtree 5 of M on input x is 
a subtree of T witnessing that M accepts x. The idea is that all the nodes in S must be 
accepting configurations, and, furthermore, S must contain the initial configuration, i.e., 
the root of T, all successors of universal configurations, and exactly one successor of each 
existential configuration. 

We say that an alternating machine M is tree-size hounded by t: N — > N if for every x 
accepted by M there is an accepting computation subtree of M on input x which has at 
most t{\x\) nodes. Let now 

ASPACE-TREESIZE(s(n), t(n)) 

denote the class of languages which can be recognized by an alternating Turing machine M 
which is space bounded by s and tree-size bounded by t. 

The following (non-trivial) inclusions and equalities are known to hold among the classes 
defined above: 

TC° C NC^ = ATIME(log(n)) C SAC^ = LOGCFL 

= ASPACE-TREESIZE(log(n),n^(i)). 

The last two equalities where shown by Venkateswaran [27] and Ruzzo [23], respectively. 

3. GrOUPOIDAL QUANTIFIERS 

In this section we consider second-order monadic groupoidal quantifiers. We show that the 
extension of SOM in terms of all second-order monadic groupoidal quantifiers collapses in 
expressive power to its fragment mon-(5Qj,pFO (respectively to mon-QQj.pFO(-|-, x)). 

The following result on first-order groupoidal quantifiers will be central for our reason- 
ing. Below, QF refers to the set of quantifier-free formulas (of suitable signature) in which 
the predicates -|- and x do not appear. 

Theorem 3.1 ([l6]). QcrpQF = FO(QGrp) = FO(QGrp,+, x) = LOGCFL. 
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We shall use the following version of Theorem 13.11 

Lemma 3.2. Let r = {ci,...,Cs}, where ci,...,Cs are constant symbols. Then on t- 
structures 

QcrpQF = FO(QGrp) = FO(QGrp, +, x). 

Proof. The idea of the proof is to translate G FO(QGrp, +, x ) [t] into ip* G FO((5Grp5 +i x ) 
of a suitable string signature using a simple encoding of r-structures into strings. By 
Theorem 13.11 we may then replace (p* by an equivalent formula 9 G QGrpQF. Finally, we 
show that 6 can be translated back to a formula 6* G (5GrpQF[r] in such a way that 9* and 
(p are equivalent. 

Suppose that K is a class of r-structures definable by (^9 G FO((5Grp5+5 x). We shall 
encode K as a class of strings over signature {Ps^ , • • • , Ps2s ) j where Si, ... , is some fixed 
enumeration of the subsets of {ci, . . . , Cg}. We associate every r-structure A with a unique 
string wji over the same universe in the following way. For S Q {ci, . . . , c^}, define 

= {6 I cf = 6 ^ Q G 5}. 

Note that the predicate Pfn is interpreted by the set {0, . . . , n — 1} \ {cf , . . . , } where 
{0, . . . , n — 1} is the universe of A. 

Let if* be acquired from 99 by replacing atomic subformulas of the form q = t by 
yciGsPsit) and Ci = Cj by the formula 3y(\/c^^Cj£sPs{y))- It is now obvious how to translate 
atomic formulas using the predicates +, x, and <, e.g., the formula q < t is replaced 
by 3y{\/ciesPs{y)) ^ V < t). It is easy to show using induction on the construction of 
if G FO(QGrp) +; x) that for all r-structures A, 

By Theorem 13.11 there is a sentence 9 G QGrpQF which is equivalent to ip* over strings. Let 
9* be acquired from 9 by replacing subformulas Ps{t) by 

{|\ci = t)^{ l\ cj + 1). 

c,es Cje{ci,...,cs}\s 
Again by induction on G QGrpQF we get that for all r-structures A, 

It fohows that 61* G ^GrpQF defines K. □ 

We are now ready for the main result of this section. Note that the built-in predicates 
+ and X are definable already in terms of (first-order) majority quantifiers (see [1] and |16j ) 
and hence definable in the logics in which groupoidal quantifies are allowed to be nested. 

Theorem 3.3. SOM(mon-Q^^p) = mon-Q^^pFO. 

Proof. Fix a signature r = (P^i , ■ ■ ■ , Pas)- Suppose that ^p G SOM(mon-Q^j,p) is a sentence. 
We will show how to construct a sentence of the logic mon-^GrpFO equivalent to ip. The 
idea of the proof is to represent ip G SOM(mon-(5Qj,p), and the language of signature r 
defined by ip, in terms of ip* G F0((5Grp, + , x), and the class of fi-structures defined by ip* , 
where a = {ci , . . . ,Cs} and constant symbols. More precisely, by representing 

r-structures of cardinality n by cr-structures of cardinality 2", we can replace second-order 
variables over the domain {0, . . . , n — 1} by first-order variables ranging over {0, . . . , 2" — 1} 
using the BIT-predicate. Then we apply Lemma[32]to get a formula 9 G QGrpQF equivalent 
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to if* . Finally, we show that 9 can be translated back to a formula 9' G mon-(5Qj,pFO [r] in 
such a way that the original formula ip and 9' are equivalent. 

Denote by a = {ci, . . . ,Cs} the signature where each Cj is a constant symbol. For a 
r-structure A = ({0, . . . , n — 1}, <, , • • • , -P^), let A* be the following cr-structure 

A* = ({0,...,2"-l},<,+,x,cf ,...,cf ), 

where c^* is the unique integer (< 2") whose binary representation is given by sq • • • Sn-i 
where Sj = 1 <^=^ j G P^. 

We shall first show that there is a sentence (p* € F0((5Grpj+) x)[c] such that for all 
r-structures A, 

A^^^A*^ip*. (3.1) 
The translation (/9 (^* is defined inductively as follows. For ip of the form X i — 3j j or 
Xi < Xj, (p* := ip, and in the remaining cases (we may exclude the definable constants 
min, max, and the second-order existential quantifier from SOM(mon-QQj.p) since Q'^^ is 
available) the translation is defined in the following way: 

Pa^ixi) ^ mT{ci,n-{xi + l)) 
Yi{xj) BlT{yi,n-{xj + l)) 

il) Acf) ~^ Ip* A(f)* 

-nip -1-0* 

3xi^p 3xi{xi < n A 'ip* (xi)) 

Q*LYi,...,Yk[ipi,...,ips-i] QLyi,---,yk[i^l,---,i^*s-i] 

It is straightforward to show using induction on the construction of € SOM(mon-QQj.p)[r], 
that for all r-structures A and assignments s, 

A\=sip^A* ^s* 

where the assignment s* over A* is defined such that s*{xi) = s{xi), for all first-order 
variables Xi, and, for a second-order Yi: if s{Yi) = A C {0, ...,n — 1}, then s*{yi) is 
the unique a < 2" whose length n binary representation is given by sq • • • Sn-i where 
Sj = l ^ j G A. 

Above, we use the predicate BIT which is definable in F0(-|-, x) (see, e.g., [13|). Note 
also that, using the predicate BIT, the integer n can be easily defined over the structure 
A*. 

By Lemma 13.2^ there is a sentence 

9 = QlXi,.. .,xi[xi,-- ■,Xw], 

where each Xi is quantifier-free and does not contain the predicates -|- and x , equivalent to 
ip*. The idea is now to translate 9 into 9' G mon-QQj.pFO by changing first-order variables 
to second-order variables. Denote hy X = Y the formula \/z(X{z) -(-^ Y(z)), and by X < y 
the first-order formula defining the ordering of subsets when treated as length n binary 
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strings. The translation 6 9' is now defined by 

t = i Xt = 
t <i Xt< X^ 
■0 A ~^ ip' A(j)' 

Qlxi, . . . . . . ,V'i>] --^ Ql^i, - ■ ■ . . ,11^'^] 

Above, t is either min, max, q, for 1 < / < s, or a variable x, and, respectively, Xt- is either 
_L, T, or X. A straightforward induction implies, in particular, that for all sentences 
"0 £ QGrpQF[o"], and r-structures A 

where A* is defined as above. It is now immediate that 0' and the original sentence € 
SOM(mon-QQj,p)[r] are equivalent. □ 

Note that, by Proposition 12.71 we do not need to consider the semantics Q\ separately. 
By combining Theorem 13.31 and Proposition 12.71 we get 

Corollary 3.4. 

SOM(mon-Qj.,p) = mon-Q^,pFO(+, x) 

= mon-Q^^pFO = SOM(mon-Q^^p). 

We close this section by showing that a much stronger analogue of Corollary 13.41 holds. 
Recall that the so-called Greibach's hardest context-free language H \s a, nondeterministic 
version of the Dyck language D2, the language of all syntactically correct sequences con- 
sisting of letters for two types of parentheses. It is known that every L G CFL reduces to 
H under some homomorphism [12]. It was shown in [16] that the statement of Theorem 13. II 
remains valid even if the logic QcrpQF is replaced by the logic Qpad{_f/)QF, where pad(i?) 
is H extended by a neutral letter. This result directly implies the following strengthening 
of Corollary 13.41 



Theorem 3.5. 

SOM(mon-Q^,p) ^ mon-Qi,d(^)FO(+, x) 

^ mon-Q;^d(^)FO ^ SOM(mon-Q^,p). 

Proof. The proof is analogous to the proof of Theorem 13.31 It suffices to prove the last 
equality in the statement of the theorem. Suppose that ip € SOM(mon-(5Qj,p) is a sentence. 
By an analogous argument as in the proof of Theorem [331 we first translate ip into a sentence 
(p* € F0((5Grpi +,><)• Then we replace ip* by an equivalent sentence 6 G <5pad(_H')QF- Now, 
again by an analogous argument as in the proof of Theorem 13.31 ^ can be translated back 
to the logic mon-Qpg^jj-^jFO. □ 
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4. MONOIDAL QUANTIFIERS 

In this section we consider second-order monadic monoidal quantifiers. 

As already mentioned, the fohowing result completely characterizes the picture in the 
case of the semantics without built-in arithmetic. 

Theorem 4.1 ([11]). mon-Qij^^FO = SOM(mon-Qij^„) = REG = 3S0M. 

Interestingly, the expressive power of monoidal quantifiers collapses to regular languages 
when built-in arithmetic is not present. Under reasonable complexity theoretic assumptions, 
the corresponding equalities between the logics in Theorem 14.11 do not hold with built-in 
arithmetic. Furthermore, it is an open question if the analogue of Theorem 14.11 holds with 
respect to the semantics Q^. Again, by Proposition l2.71 we however know that the semantics 
coincide assuming built-in arithmetic. 

Theorem 4.2. The following equivalences hold 

(1) mon-Qi,„^FO(+, x) = mon-QJ;,„^FO(+, x), 

(2) SOM(mon-gi,„^,+, x) = SOM(mon-QJ;,„„, +, x). 

Note that also in equivalence [2] of Theorem 14.21 the arithmetic predicates (in fact -|- would 
also suffice) need to be assumed by Theorem 14. 1^ i.e., -|- and x are not definable in 
SOM(mon-Qj^Qj^). It is an open question whether the equivalences of Theorem 14.21 hold 
without built-in arithmetic. 

5. Complexity results 

In this section we study the data complexity of the logics discussed in the previous sections. 

We begin with a simple logical padding argument which allows us to utilize Theorem 
12.111 in the context of second-order monadic quantifiers. Recall that, in the statement of 
Theorem 12. 11^ the quantifier is allowed to bind relation variables of arbitrary arity (see 
Remark 12. 8p . Below, we do not distinguish notationally between a string w of alphabet 
(oi, . . . , Os), and the string structure of signature (Pai , • • • , Pas) corresponding to w. 

Proposition 5.1. Let L be a language and suppose that a language A of alphabet E is 
definable by a sentence f € Q*^FO. Let k be the arity of the relations quantified in ip and 
tJ ^ S. Then the language 

A* = {w^^\'"\'~\'"\ \weA} 
is definable in FO(mon-(5J^, -|-, x). 

Proof. Let 99 be of the form 

Qii?i,...,i?t[V'i,...,V's], 
where each of the relations Ri has arity k. Define a translation ip ip* as follows. For ip of 
the form Xi = Xj, Xi < Xj, or P^. (xj), if* := ip, and in the remaining cases (we exclude the 
definable constants min and max) the translation is defined in the following way: 

Ri{xi,. . . ,Xk) 3z{Xniiz) Az = \w\''^^xi-\ \- \w\xk-i + Xk) 

Ip A(j) Ip* A(l)* 

-nip -W —1^* 

Bxip 3x{x < \w\ A^*{x)) 

QlRi,...,Rt[^Pi,...,^Ps-i] - QlXR„...,XnAri, ■■■,€-!] 
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It is straightforward to show using induction on -0 £ Ql^O that for ah w and assignments 
s 

where s* agrees with s with respect to first-order variables, and 

s*{Xr^) = {a \ a = \w\'''^bi H \- \w\bk-i + bk for some {bi,. . . ,bk) G s{Ri)}. 

Note also that, by our conventions, the universe of t^'^jjl"'!''"!"'! is {0, . . . — 1} hence there 
is 1-1 correspondence between the subsets of {0, . . . — 1} and the k-ary relations over 
the universe, {0, . . . , n — 1}, of w. 

Finally, the language A* is defined by 99* A where x € F0(+, x) and 

Proposition 15.11 shows that logics FO(mon-(5^, +, x) can be quite powerful. In fact, it is 
apparent from the proof that if, e.g., ip G Q^FO in the proof of Proposition 15.11 defines a 
PSPACE-complete language, then the language defined by (p* E mon-(5^FO(+, x) is also 
PSPACE-complete. 

Corollary 5.2. In the logic mon-Ql^^^FO{+, x), PSPACE-complete languages can be de- 
fined. 

Proof. This follows, e.g., by the fact that 

Q^FO = PSPACE, 

where B is the word-problem for the group (see Section [2. 2p . and by Proposition 12.71 □ 

Recall that in the case of groupoidal quantifiers, already in mon-QQj.^FO PSPACE-complete 
languages can be defined by Corollary 12.141 

Next we show that the logics SOM(mon-QJ^Qj^, +, x) and SOM(mon-QQj.p) capture the 
exponential versions of the language classes captured by the logics FO((5Mon;+, x) and 
F0(QGrp5+, X)- As already noted in Theorem 13. H the logic F0((5Grp;+, x) corresponds 
to LOGCFL [TB]. On the other hand, in [4J it was show that 

FO(QMon,+, x) = NC^ = ATIME(log(n)). (5.1) 

Definition 5.3. For n G N, denote by bin(n) the binary representation of n without leading 
zeros. Let L C {0, 1}"^ and IL = {Iw | w & L}. Define now tally (L) as 

tally(L) = {1" I bin(n) G IL}. 

Let us now define the classes of languages 2^TiME(iog{n)) 2LOGCFL 

2ATiME(iog{n)) = c {0, 1}+| tally(L) e ATIME(log(n))}, 
2LOGCFL = c {0, 1}+| tally(L) G LOGCFL}. 
The following is easily seen to hold: 

Proposition 5.4. The following equalities hold 

(1) 2ATIME(log{n)) ^ ATIME(n), 

(2) 2LOGCFL ^ AgpAQE-TREESIZE(n,2^(")). 
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Proof. The first equality is obvious and the second follows from Ruzzo's characterization of 
LOGCFL: 

LOGCFL = ASPACE-TREESIZE(log(n),n^(i)) 
(see p3] and [29]). □ 

Remark 5.5. By the above, we immediately get that 

NSPACE(n) C 2LOGCFL^ 

It is also straightforward to show that 2^^*-''-'^^ includes the languages that can be recog- 
nized in linear time on a Threshold Turing machine (introduced in |21j). 

The main result of this section can be now stated as follows: 

Theorem 5.6. The following equivalences hold 

(1) SOM(mon-QJ;,„^,+, x) = ATIME(n), 

(2) SOM(mon-Q^^p) = 2L0GCFL_ 

Proof. We will prove equivalence [2j Equivalence [1] is proved analogously using the fact that 
ATIME(log(n)) = FO(gMon,+, x) (see ^J^). 

We win show that, for ah L C {0, 1}+, L is definable in SOM(mon-Q^j,p) iff tally (L) G 
LOGCFL. Since LOGCFL = FO(QGrp,+, x), it suffices to show that for ah L C {0,1}+, 
L is definable in SOM(mon-QQj,p) iff tally(L) is definable in FO((5Grp;+5 x). 

We will first show that if L is definable in SOM(mon-(5Qi.p), then tally (L) can be defined 
in F0((5Grp; +) x). The idea is now to translate formulas between string structures 

«; = ({0,...,m -!},<, Pi, Po), (5.2) 

and 

1" = ({0,...,n-l},Pi,<,+,x), (5.3) 

where w € {1, 0}+, bin(n) = Iw, and Pi = {0, . . . , n — 1}. Some technical difficulties arise 
here, which were not encountered in the proof of Theorem 13.31 due to the fact that the sizes 
of the universes of w and 1" are not necessarily exactly of the form I and 2' for some 

We define a translation (/? of € SOM(mon-(5Qj,p) into ip* G FO{QGrp,+, x) 

below. Analogously to the proof of Theorem \'6.'6\ it can be shown using induction on 
if G SOM(mon-(5Qj.p), that for all w and assignments s, 

The assignment s* is defined so that it agrees with s on first-order variables Xi, and, for 
a variable yi, corresponding to a second-order variable Yi, s*{yi) = a < 2™, where a = 
S^o^Si2™-i-^ and = 1 iff i G s{Yi}. 

The translation 99 ~^ 99* is defined inductively as follows. For ip of the form Xj = xj 
or Xi < Xj, if* := 93, and in the remaining cases (again, we exclude the definable constants 
min, max, and the second-order existential quantifier from SOM(mon-(5Qfp) since Q*j^^ is 
available) the translation is defined in the following way (recall that by (|5.2p and ()5.3p we 
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have [log(n)J = m = \w\): 

Pi{xi) ^ BIT(n, [log(n)J - (x, + 1)) 

Po(xi) ^ -BIT(n, Llog(n)J - {xi + 1)) 

Yi{xj) ^ BIT(y,, Llog(n)J - (x,- + 1)) 

tl) A(f) -w tl)* A(f)* 

3xiip 3xi{xi < [log{n)\ A'4>*{xi)) 

Note that, e.g., the formula BIT(n, [log(n)J — (xj + 1)) above can be easily constructed even 
though the integer n is not in the universe of the structure 1" . Without loss of generality, 
we may assume that the letter ag in the alphabet (01,02, .. . ,as) of L is a neutral letter. 
Now the formula Xi used to translate Q^, ensures that the interpretation bi,...,bk G 
{0, . . . ,n — 1} of the tuple yi, . . . ,yk does correspond to some tuple of unary relations 
Bi, . . . ,Bk C {0, . . . , m — 1}. The problem is that there can be more tuples b than tuples 
B. The formula % is defined as 

/\ < 2Li°gHJ - 1. 
i<«<fe 

Now if X is not satisfied by b, then none of the formulas x A ip* will be satisfied and hence 
these formulas produce the neutral letter o^ when y is interpreted as b. 

We conclude that, by the above, if L is defined by a sentence ip G SOM(mon-QQj.p), 
then the sentence 

(</^* AVxPi(x)) GFO(QGrp,+,x), 

defines tally (L). 

We will next define a formula translation if ^ ip' mapping ip € F0((3Grp)+, x) into 
if' G SOM(mon-(5Qj.p). Again, an analogous induction on the construction of (p shows, in 
particular, that for all sentences (p and all n G N, 

bin(n) ^ ip' ^ l"" \= ip. 

The translation ip ip' is defined by replacing first-order variables by unary second-order 
variables: 

X = y X = Y 
X <y X <Y 
x + y = z X + Y = Z 

X X y = z ~^ X xY = Z 
Pi{x) T 

ip A(f) ~^ ip' Acf)' 
—lip —1^' 
3x^ 3X{6Aip') 
Qlxi, . . . ,xv[ipi, . . . ,ip^] QlXi, . . . ,X^[5 Aip'i, . . . ,5 Aip'^] 

Above, X = Y denotes the formula \/z{X{z) o Y{z)). Also X < Y, X + Y = Z, and 
X X Y = Z are formulas defining the ordering, addition, and multiplication of unary 
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relations, when treated as binary strings. Finally, the formula 5 is simply 

6 = X <Pi, 

and it has an analogous role here as x had above. Again, we may assume that is a neutral 
letter of L when translating the quantifier Ql. 

By the above, it holds that for all L C {0,1}+: if tally (L) G LOGCFL, then IL is 
definable in SOM(mon-QQj.p). In order to complete the proof, it suffices to show that L 
is definable in SOM(mon-(5Qj,p) iff IL is definable in SOM(mon-(5Qj,p). Note that on the 
computational side, L and IL are easily definable from each other. On the logical side, it 
follows from the fact that SOM(mon-QQj.p) is closed under logical reductions for which the 
target structure w* has size linear in w. The idea is that if \w*\ = k\w\ then subsets of w* 
can be encoded by k subsets over w and, hence, such a formula translation can be defined 
in terms of second-order monadic quantifiers. This implied, in particular, that for any 
sentence ip € SOM(mon-QQj,p) we can construct a sentence (p* G SOM(mon-(5Qj.p) which 
holds over w iS Iw \= (p. We do not give the proof here but refer to the proof of Corollary 
8.6 in [15] in which an analogous result is proved for the extension of FO in terms of the 
second-order monadic majority quantifier. □ 

Finally, we turn to the case of symmetric (commutative) languages. It is obvious that, 
for a symmetric language L, the quantifiers and Q2 equivalent. Let CFL* and REG^ 
denote the classes of symmetric context-free and regular languages, respectively. 

Denote by Mod-LinH the linear analogue of the class Mod-PH. Recall that Mod-PH is 
the oracle hierarchy, analogous to the polynomial hierarchy PH, in which the building block 
of the hierarchy is NP U (ModqP)q>i and the {k + l)th level is defined by allowing access 
to oracles from the Boolean closure of the kth level. Similarly, we denote by Lin-CH the 
linear analogue of the counting hierarchy CH, which is the oracle hierarchy with PP as the 
building block. 

The expressive power of second-order monadic quantifiers defined by symmetric regular 
and context-free languages can be characterized as follows: 

Theorem 5.7. The following equivalences hold 

(1) SOM(mon-Q^j5G«, -h, x) = Mod-LinH, 

(2) SOM(mon-g^pLs) = Lin-CH. 

Proof. Let us first show equivalence El Note that by Parikh's theorem on context-free 
languages, every symmetric context-free language is already in TC'' and, by [4], TC'^ = 
F0((5Maj) +, X)- Therefore, we get that 

SOM(mon-Q£FL0 = SOM(mon-Qj^^j) 

by an analogous argument as in Theorem 13.31 In [15] it was shown that 

SOM(mon-gj^^j) = Lin-CH, 

hence the claim follows. 

For equivalence dl note that F0(Qreg'';+5 ><) = ACC° (see [1]) and that, analogously 
to AC° and PH, ACC° is the logarithmic analogue of Mod-PH (see [21 d]). Hence, by 

"'^More generally, it also implies that SOM(mon-QQjp) captures 2'^°*^'^^'^ over all string signatures, since 
a string w of any signature can be encoded in binary with length 0{\w\). 
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standard padding we get that 2 = Mod-LinH and, mimicking the proof of Theorem 
15.61 it fohows that 

SOM(mon-Q^EG. , +, x ) = Mod-LinH. □ 

We conclude this section by the following table summarising the results on the data- 
complexity of the logics studied in this paper. 



Table 1: Summary of the results 



Logic & built-ins 


{<} 


{+,x} 


Result 


mon-Q^^jjFO 
mon-Q^^jjFO 


REG 
> REG 


PSPACE-comp. 
PSPACE-comp. 


Thm|4JJ[ll|, Cor 15.21 
[7, 26J, CorlO 


SOM(mon-Qi,„J 
SOM(mon-QV) 


REG 
> REG 


ATIME(n) 
ATIME(n) 


Thm|4JJpJJ, ThmlMI 
[7, 26], ThmES 


mon-Q^,pFO 
mon-Q^,pFO 


PSPACE-comp. 

2LOGCFL 


2LOGCFL 


Cor 12.141 [18J, ThmlMI 
Thm[5]6] 


SOM(mon-Q^^p) 
SOM(mon-Q*^p) 


2LOGCFL 


2LOGCl^'L 
2LOGCFL 


Thm 15.61 
Thml^TU] 


SOM(mon-(5^j5Qs) 
SOM(mon-Q*pL0 


REG 
Lin-CH 


Mod-LinH 
Lin-CH 


Thm|4JJlllJ, Thm|5^ 
ThniEZl 



6. Conclusion 

We conclude with two questions for further study. The main open question regarding 
groupoidal quantifiers is to determine whether the two variants of semantics for second- 
order groupoidal quantifiers coincide in the most restricted case studied in this paper, i.e., 
is it the case that 

mon-Qjj,pFO = mon-Q^,pFO? 
A positive answer would imply that 

Leaf^^(CFL) = ASPACE-TREESIZE(n, 2^(")). 

This would strengthen the recent PSPACE-hardness result [18] considerably (showing that 
Leaf^^(CFL) contains PSPACE-complete problems, and answering the open question from 

The second open question concerns the expressive power of the quantifiers Q^, for 
a regular L. It is an open question whether non-regular languages can be defined in 
SOM(mon-gj;,„J. 
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